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PHOSPHAT IDYLINOSITOL LINKING OF PROTEINS 

The present invention relates to glycosyl 
phosphatidylinositol ("PI") linkage structures, which 
in vivo covalently anchor certain proteins to the 
surface of the cells in which they are produced. More 
particularly, the present invention relates to the 
isolation of DNA sequences coding for the 
phosphatidylinositol linkage signalling sequence of 
lymphocyte function-associated antigen 3 ("LFA-3") , and 
to processes for providing Pi linkage structures to 
extracellular proteins (or the extracellular portion of 
membrane proteins) , thereby giving such proteins a 
Pl-linked form, with resulting advantages such as 
releasable plasma membrane binding, enhanced 
purifiability, the capability of micelle formation, 
etc. The invention also relates to novel chimeric 
polypeptides having Pi linkage structures, hybrid DNA 
sequences encoding such polypeptides, and to products, 
methods and compositions made possible by the 
discoveries detailed herein. 

BACKGROUND OF THE INVENTION 

Proteins produced by expression of cellular 
DNA may be divided into three general groups: 
cytoplasmic proteins, which remain wholly within the 
cell; extracellular or "soluble" proteins, which are 
secreted outside the cell; and membrane proteins, which 
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become attached to the phospholipid bilayer of the cell 
membrane. 

Different membrane proteins play major roles 
in a wide range of cell functions, such as anchoring 
5 cytoskeletal components, mediating cell-to-cell 

adhesion, transporting molecules into and out of the 
cell, receiving signals from hormones and other 
chemical transmitters, and many others. 

Membrane proteins are bound to the plasma 

10 membrane in a variety of ways. Peripheral or 

"extrinsic" proteins do not penetrate the phospholipid 
bilayef fiut instead interact with the polar head groups 
on one surface of the bilayer or interact with other 
membrane proteins anchored directly to the bilayer. 

15 Integral or "intrinsic" proteins have a region which 
interacts with the hydrophobic core of the bilayer. 
Transmembrane proteins, such as cell surface antigens, 
celliilar receptors, adhesion molecules and transport 
proteins,' traverse the bilayer one or more times, 

20 exposing both cytoplasmic and extracellular regions or 
domains * 

A further possibility for attachment to the 
plasma membrane is via covalent linkages between the 
lipid bilayer and the protein. In particular, a number 

25 of cell surface proteins are anchored to the cell 
membrane ttprough a C-terminal, covalently attached 
glycosylated phosphatidy linos itol moiety. See, e.g., 
Low et al., "Covalently Attached Phosphatidyl inositol 
As A Hydrophobic Anchor For Membrane Protein", Trends 

30 Biol. Sci. . 11, pp. 212-15 (1986) . The glycosyl 
phosphatidylinositol linkage, or "PI" linkage, is 
believed to be the anchoring structure for more than 
two dozen specific membrane proteins, found in a wide 
distribution of species and cell types. See Ferguson 

35 et dl., "Cell Surface Anchoring Of Proteins Via 
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Glycosyl-Phosphatidylinositol Structures", Ann. Rev. 
Biochem. , 57, pp. 285-320 (1988) (incorporated herein 
by reference). The biological functions of these 
Pi-linked proteins are also highly diverse, including 
5 surface hydrolases, coat proteins, surface antigens and 
adhesion molecules. 

The C-terminal amino acid residue of the 
Pi-linked protein is attached to the membrane 
phosphatidylinositol moiety via an ethanolamine- 

10 phosphodiester-glycan bridge. Ferguson et al. , 

"Glycosyl-Phosphatidylinositol Moiety That Anchors 
Trypanosoma brucei Variant Surface Glycoprotein To The 
Membrane", Science, 239, pp. 753-59 (1988); Low, 
"Biochemistry Of The Glycosyl-Phosphatidylinositol 

15 Membrane Protein Anchors", J. Biochem. , 244 . pp. 1-13 
(1987) (incorporated herein by reference) . The 
mechanism of attachment is thought to involve 
processing of a c-terminal hydrophobic sequence present 
on the precursor protein but eliminated prior to the PI 

20 anchoring of the mature protein. Ferguson et al., 
supra, Ann. Rev. Biochem. f 57 . at 301-04. The 
processed C-terminal segments are believed to be a 
signal for phosphatidylinositol attachment. In one 
study, DNA coding for the 37-amino acid C-terminal 

25 sequence of the Pi-linked protein, decay accelerating 
factor (DAF) , was fused to the 3 • end of DNA coding for 
a normally secreted protein fragment of glycoprotein D 
(from herpes simplex virus-1) , resulting in a Pi-linked 
fusion protein. See Caras et al., "Signal For 

30 Attachment Of A Phospholipid Membrane Anchor In Decay 
Accelerating Factor", Science . 238 r pp. 1280-83 (1987). 
However, comparison of the C-terminal sequences of many 
precursors of Pi-linked proteins has failed to reveal a 
consensus PI linkage signalling sequence. See, Low, 
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supra . Figure 3; Ferguson et al., supra , Ann. Rev, 
Biochem. r 57 f Table 3. 

Although there has been a considerable amount 
of investigation of phosphatidylinositol-linked 
5 proteins and the phosphat idy linos itol linkage, there is 
a need for further characterization of specific PI 
linkages and for further investigation into the signal 
and mechanism for PI attachment to the cellular 
membrane r as well as the selective release of membrane 

10 bound proteins. These needs are addressed by the 

present invention, relating to the PI linkage structure 
of a Pi-linked form of lymphocyte function-associated 
antigen 3 ( M LFA-3 W ) , and to applications of PI linkage 
signalling sequences derived from Pi-linked LFA-3. 

15 The PI linkage signalling sequences of the 

present invention, when linked in frame to, the 3* end 
of DNA coding for a secreted polypeptide, or the 
secreted portion of a polypeptide, result on expression 
in a Pi-linked form of the polypeptide. Using the DNA 

20 sequences and methods herein, novel chimeric proteins 
bearing a C-terminal phosphatidylinositol structure and 
numerous applications for such products are made 
possible. 

SUMMARY OF THE INVENTION 

25 Accordingly, it is an object of the present 

invention to provide DNA sequences coding for a 
phosphatidylinositol linkage signalling sequence. 

It is a further object of the present 
invention to provide DNA sequences coding on expression 

30 far novel polypeptides having a C-terminal 
phosphatidylinositol linkage structure. 

It is a further object of the present 
invention to provide novel Pi-linked polypeptides and 
to provide a means for synthesizing a Pi-linked form of 
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soluble proteins or of the extracellular domains of 
integral membrane proteins. 

It is a further object of the present 
invention to provide micelles of novel polypeptides 
bearing intact phosphatidylinositol linkage structures 
and to provide liposomes having surface characteristics 
determined by Pi-linked polypeptides according to this 
invention. 

It is a further object of the present 
invention to provide a means for altering the surface 
protein complement of cell cultures and to provide host 
cells capable of expressing Pi-linked forms of 
exogenous proteins or polypeptides. 

It is a further object of this invention to 
provide methods for directing cell-to-cell adhesion or 
targeting specific cells, for screening cells, and for 
directing the action or increasing the specificity of 
proteins and therapeutic agents. 

These and other objects which will be 
apparent from the following description are 
accomplished herein by the discovery of DNA seguences 
coding for a lymphocyte function-associated antigen 3 
phosphatidylinositol linkage signalling sequence, and 
DNA sequences which code on expression for polypeptides 
coded for on expression by such DNA sequences. 

BRIEF DESCRIP TION OF THE DRAWINGS 

Figure l depicts a comparison of the DNA 
insert carried in phage AP24 (and the deduced amino 
acid sequence) , which contains a coding region for 
Pi-linked LFA-3, and the DNA insert carried in phage 
AHT16 (and the deduced amino acid sequence) , which 
contains a coding region for the transmembrane form of 
LFA-3 . 
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Figure 2 depicts the DNA coding sequence and 
the deduced amino acid sequence of Pl-linked LFA-3, 
with underscoring indicating a phosphatidyl inositol 
linkage signalling sequence. 
5 Figure 3 depicts the amino acid sequence of 

natural CD4 protein, a surface antigen on T-cells. The 
putative signal sequence and the transmembrane region 
are indicated by underscoring. 

Figure 4 depicts the DNA coding sequence 
10 contained? in plasmid T4 /LFA-3 /AD and coding for a 

i fusion protein characterized by an extracellular domain 
of Recombinant CD4 protein and a terminal sequence 
corresponding to the 28 C-terminal amino acids of 
, Pl-linked LFAr3. The deduced amino acid sequence of 
15 ,the chimeric protein is also indicated. 

figure 5 depicts the DNA coding sequence 
contained in plasmid T4 /LFA-3/ 2 and coding for a 
chimeric protein characterized by an extracellular 
domain of recombinant CD4 protein and a terminal 
20 sequence believed to represent a PI linkage signalling 
sequence Recording to the invention. The deduced amino 
ajsid seqpence of the chimeric protein is also 
indicated. The fusion protein resulting from 
expression of this coding sequence in a transformed 
25 host is 6%pected to yield a Pl-linked form of CD4. 

Figure 6 is a graphic representation of an 
expeariinent to show that Clone #11 of CHO cells 
transf acted with the hybrid DNA sequence T4 /LFA-3/ 2 
accordin^to the invention produced a Pl-linked CD4 
30 protein, Jreleasable from the cell surface by incubation 
with PIPLp. 

* DETAILED DESCRIPTION OF THE INVENTION 

It has been discovered herein that a portion 
of the 3» end of cDNA coding for Pl-linked LFA-3 can be 
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isolated and linked, in turn, to the 3* end of a DNA 
sequence coding for a secreted polypeptide (or a 
secreted portion of a polypeptide) to confer PI linkage 
to the polypeptide. The 3" portion selected from the 
DNA coding for Pi-linked LFA-3 which is able to confer 
PI linkage to a secreted polypeptide is referred to 
herein as a "phosphatidylinositol linkage signalling 
sequence." Preferably, the phosphatidylinositol 
linkage signalling sequences are derived from DNA 
coding for the Pi-linked form of lymphocyte 
function-associated antigen 3 ("LFA-3") as described 
herein . 

LFA-3 is a surface glycoprotein found on many 
cells , e.g., erythrocytes , monocytes , granuloctyes , 
cytotoxic T-lymphocytes , B-lymphoblastoid cells, smooth 
muscle cells, endothelial ceils, fibroblasts. Springer 
et al., "The Lymphocyte Function-Associated LFA-1, CD2, 
and LFA-3 Molecules: Cell Adhesion Receptors Of The 
Immune System", Ann. Rev. Immunol . r 5, pp. 223-52 
(1987) . LFA-3 binds to CD2 (a T- lymphocyte accessory 
molecule) and is believed to mediate adhesion of 
T-lymphocytes to target cells. This adhesion is 
essential to initiation of the T-lymphocyte functional 
response. Dustin et al., "Purified Lymphocyte 
Function-Associated Antigen 3 Binds To CD2 And Mediates 
T Lymphocyte Adhesion", J. Exp. Med. r 165 . pp. 677-92 
(1987) . 

LFA-3 occurs in two distinct cell surface 
forms: a transmembrane form and a Pl-linked form. 
Dustin et al., "Anchoring Mechanisms For LFA-3 Cell 
Adhesion Glycoprotein At Membrane Surface", Nature , 
329, pp. 846-48 (1987) . Comparison of cDNA sequences 
encoding the precursor polypeptides of both forms has 
revealed that the 5« ends of the sequences are 
identical but that the 3' ends of the sequences differ. 
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See commonly assigned , copending U.S. patent 
application Ser. No, 237 , 309 , filed August 26, 1988 
(incorporated herein by reference) and Figure 1 herein. 
The HT16 cDNA coding for the transmembrane form of 
5 LFA-3 contains a 3 1 segment (HT16 Nucleotides 655 
through 723 , Figure 1) coding for a putative 
transmembrane region of 23 hydrophobic or uncharged 
amino acids and a 3 1 segment (HT16 Nucleotides 724 
through 759, Figure 1) coding for a putative 

10 cytoplasmic tail of 12 amino acids. The P24 cDNA 

coding for the Pi-linked form of LFA-3 is shorter at 
the 3 1 end by thirty nucleotides, with 100% homology 
with the transmembrane form until the nucleotides 
coding for the last four amino acids (P24 Nucleotides 

15 718 through 729, Figure 1). Studies with transmembrane 
LFA-3 have demonstrated that removal of the 12 -amino 
acid cytoplasmic domain converts the protein into a 
Pi-linked protein. Therefore, it is theorized that the 
functional signal for phosphatidylinositol attachment 

20 is contained within the carboxy-terminal sequence of 
LFA-3 . 

The phosphatidylinositol linkage signalling 
sequences according to the present invention may be 
isolated from DNA coding for Pi-linked LFA-3 or 

25 synthesized directly using standard techniques. 

Preferred phosphatidylinositol linkage 
signalling sequences according to the present invention 
are derived from DNA coding for Pi-linked LFA-3. Most 
preferably, the phosphatidylinositol linkage signalling 

30 sequence comprises the following: 
5 1 -AGCAATGCATTATTTAATACAACATCATCA^ 
ATCCCAAGCAGCGGTCATTCAAGACACAGATA 

TTAGCAGTAATTACAACATGTATTGTGCTGTATATGAATGGTATGTATGCT 
TTT-3 ' . 



The ability to attach a phosphatidylinositol 
structure according to this invention to an otherwise 
soluble protein or polypeptide leads to a wide variety 
of uses, some of which are discussed below: 

5 I. Chimeric Polypeptides 

The phosphatidylinositol linkage signalling 
sequences of the present invention may be used to 
construct hybrid DNA sequences coding for "chimeric" 
polypeptides having a PI linkage. As used herein, 

10 "chimeric" proteins or polypeptides will refer to 

proteins or polypeptides endowed with a PI structure in 
accordance with the teachings herein. A "chimeric 
protein" therefore will typically have a protein 
component (e.g., corresponding to a soluble protein) 

15 and a phosphatidylinositol component. Chimeric 

proteins are preferably the result of expression of the 
hybrid DNA molecules described herein. Such hybrid DNA 
molecules are advantageously prepared by linking the 5' 
end of a phosphatidylinositol linkage signalling 

20 sequence in the correct reading frame to the 3« end of 
a DNA segment coding for a secretory polypeptide or the 
secreted portion of a polypeptide. On expression in an 
appropriate host cell, a chimeric polypeptide or fusion 
protein is produced having an extracellular domain and 

25 a PI linkage anchoring it to the host cell membrane. 

Suitable polypeptides for attachment of 
phosphatidylinositol structures according to this 
invention may be any polypeptide or fragment thereof 
which is normally secreted from the cell in which it is 

30 produced, or which can be made to be secreted by the 
attachment of a secretory signal sequence to the 
N-terminus. Suitable polypeptides will also include 
the extracellular portions of membrane-bound proteins, 
for example, where it is desired to produce a Pi-linked 
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form of a protein normally anchored to the cell surface 
via a transmembrane region and/or a cytoplasmic domain. 
Suitable types of polypeptides which may be 
advantageously provided with a phosphatidylinositol 
5 structure include cell surface ligands, antibodies to 
cell surface antigens, hormones, cytotoxins, cell 
activation ligands, soluble receptors, tumor markers, 
etc. In a particularly preferred embodiment, DNA 
encoding a truncated, soluble form of CD4 (also known 

10 as n T4"} f ;va surface protein on helper T cells, is fused 
in frame with an LFA-3 phosphatidylinositol linkage 
signalling sequence according to this invention, to 
provide a novel chimeric protein having the CD4 
receptor and a C-terminal phosphatidylinositol 

15 structure. (See Figure 3 herein.) DNA sequences 

coding for this fusion protein can be used to transform 
host cells, and culturing such hosts will result in a 
population of cells bearing Pi-linked CD 4 on the host 
surface membranes. 

20 Since CD4 is now known to be a receptor to 

which the HIV virus (responsible for acquired immune 
deficiency syndrome) attaches in the process of 
infection, the chimeric CD4 /phosphatidylinositol 
protein prepared as described herein offers a potential 

25 treatment for HIV infection: the CD 4 receptor can be 
used to target the HIV virus; and through the PI 
linkage, the CD4 receptor can be expressed on cells 
which could destroy the virus. In like manner, the CD4 
receptor could be fused to a protein toxic to HIV 

30 infected cells (or other virus infected cells) and the 
PI structure employed in the formation of micelles or 
liposomes (discussed infra ) to provide a high 
concentration drug delivery system. 

:PJ-linked polypeptides are obtained using the 

35 hybrid DNA molecules according to the invention by 
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expressing the hybrid DNA in a host capable of forming 
the phosphatidylinositol linkage. Any eukaryotic cell 
capable of forming a phosphatidylinositol linkage may 
be employed. The selection of host cell will be made 
on the basis of additional factors, such as stability 
of cell line, ease of culturing, compatability with 
other cells or organisms, cytotoxicity toward other 
cell types, and many other factors, depending on the 
final application envisioned for the Pi-linked 
polypeptide produced by the transformation, or the 
application envisioned for the transformed cells 
themselves. Judicious selection of prospective host 
organisms will allow those skilled in the art to tailor 
the features of this invention to their diverse needs, 
while still relying on the principles described herein. 

A wide variety of unicellular host cells and 
cell lines are suitable, including, for example, 
mammalian cell cultures, e.g., CHO, Rl.i, b-W, and L-M 
cells, African green monkey cells (including COSl, 
C0S7, BSCl, BSC40, and BMT10 cells), and human cells 
(including erythrocytes, monocytes, granuloctyes, 
cytotoxic T-lymphocytes, B-lymphoblastoid cells, smooth 
muscle cells, endothelial cells, fibroblasts, etc.), 
and cell lines developed from the foregoing. 

The hybrid DNA molecules are expressed by 
operatively linking them to an expression control 
sequence in an appropriate expression vector, which in 
turn is used to transform an appropriate unicellular 
host. The operative linking of a hybrid DNA sequence 
of this invention to an expression control sequence 
includes the provision of a translation start signal in 
the correct reading frame upstream of the DNA sequence. 

Many host-expression vector combinations may 
be employed in expressing the DNA sequences of this 
invention. Useful expression vectors may be 
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constructed from segments of chromosomal, 
non-chromosomal or synthetic DNA. Suitable vectors 
include, for example, various known derivatives of SV40 
and known bacterial plasmids, e.g., plasmids from 
5 E.coli including col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, e.g., RP4, 
phage DNAs (e.g., numerous derivatives of phage A, such 
as NM989, and other DNA phages, such as M13 and 
Filamenteous single stranded DNA phages) , yeast 

10 plasmids such as the 2fi plasmid or derivatives thereof, 
and vectors derived from combinations of plasmids and 
phage DNAs, such as plasmids which have been modified 
to employ phage DNA or other expression control 
sequences. For animal cell expression, plasmid BG312, 

15 a plasmid containing the major late promoter of 
adent>Vdbtus 2, is preferred. 

■ " In addition, any of a wide variety of 

expression control sequences — sequences that control 
the expression of a DNA sequence when operatively 

20 linked to it — may be used in these vectors to express 
the DNA sequence of this invention. Such useful 
expression control sequences include, for example, the 
early and late promoters of SV40 or the adenovirus, the 
lac system, the trp system, the TAC or TRC system, the 

25 majdr operator and promoter regions of phage A, the 
control regions of fd coat protein, the promoter for 
3-phosphdglycerate kinase or other glycolytic enzymes, 
the promoters of acid phosphatase, e.g., Pho5, the 
promoters of the yeast a-mating factors, and other 

30 sequences known to control the expression of genes of 
prokarydtic or eukaryotic cells or their viruses, and 
various combinations thereof. For animal cell 
expression, it is preferred to use an expression 
control sequence derived from the major late promoter 

35 of adenovirus 2. 
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It should of course be understood that not 
all vectors and expression control sequences will 
function equally well to express the DNA sequences of 
this invention. Neither will all hosts function 
5 equally well with the same expression system. However, 
one of skill in the art may make a selection among 
these vectors, expression control sequences, and hosts 
without undue experimentation and without departing 
from the scope of this invention. For example, in 

10 selecting a vector, the host must be considered because 
the vector must replicate in it. The vector's copy 
number, the ability to control that copy number, and 
the expression of any other proteins encoded by the 
vector, such as antibiotic markers, should also be 

15 considered. 

In selecting an expression control sequence, 
a variety of factors should also be considered. These 
include, for example, the relative strength of the 
system, its controllability, and its compatibility with 

20 the particular DNA sequence, of this invention, 

particularly as regards potential secondary structures. 
Unicellular hosts should be selected by consideration 
of their compatibility with the chosen vector, the 
toxicity of the product coded on expression by the DNA 

25 sequences of this invention to them, their secretion 
characteristics, their ability to fold proteins 
correctly, their stability and culturing requirements, 
and the ease of purification of the products coded on 
expression by the DNA sequences of this invention. 

30 Within these parameters one of skill in the 

art may select various vector/expression control 
system/host combinations that will express the DNA 
sequences of this invention. 

Various agents are known to disrupt the 

35 phosphatidylinositol linkage and release the Pi-linked 
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protein from the cell surface. These include, for 
example, nitrous acid, phosphatidylinositol-specif ic 
phospholipases C ("PIPLC 11 ), phosphatidylinositol- 
specif ic phospholipases D, and possibly 
5 endoglycosidase, phosophodiesterase and proteinase 
(Low, supra ) . PIPLC, e.g., from Tr ypanosoma brucei, 
has been the agent most extensively studied. PIPLC 
irreversibly cleaves the PI linkage of Pi-linked 
proteins with a high degree of specificity, creating a 

10 soluble protein. Thus, through the use of PIPLC or 

other agents which cleave the PI linkage, the Pi-linked 
chimeric polypeptides according to the invention may be 
selectively released from the surface of the host cells 
in which they are produced. 

15 The invention, therefore, provides a method 

for obtaining recombinant proteins in soluble form by 
culturing a host transformed with a hybrid DNA molecule 
to yield Pl-anehored recombinant proteins, then 
introducing PIPLC or other agent to the culture medium, 

20 causing release of the protein in soluble form. Also 
contemplated are methods for purifying proteins which 
are difficult to purify by taking advantage of the 
releasable PI attachment to the cells in which they are 
produced. For instance, the host cells can be 

25 immobilized on a substrate by means of a surface 

receptor unrelated to the Pi-linked protein, and then 
the protein can be eluted in soluble 
form using PIPLC or another phosphatidylinositol 
linkage-disrupting agent as an eluant. 

30 II. Micelles 

Chimeric proteins prepared according to the 
present invention and thus endowed with a terminal 
phosphatidylinositol structure can be purified with the 
phosphatidylinositol structure intact by solubilization 
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of the membrane with detergents, in detergent 
solution, the chimeric proteins will generally be 
monomeric and evenly dispersed; however, removal of 
detergent by dialysis will cause aggregation of the 
5 phosphatidylinositol moieties to form micelles or 
liposomes. 

Micelle formation can be induced and 
controlled by the addition of phospholipids to achieve 
a specific ratio of protein to phospholipid. The 

10 orientation of the chimeric protein components of the 
micelles can be controlled also, so that the micelles 
have an outer surface which is predominantly composed 
of the phospholipid moieties or predominantly composed 
of the protein moieties of the chimeric proteins. The 

15 size of the micelles may also be controlled by varying 
the detergent employed, the nature of the added 
phospholipid, or the phospholipid/protein ratio. 

Generally, the size of liposomes directly 
affects the rate at which they are cleared from the 

20 bloodstream. For example, smaller liposomes and 

negatively charged liposomes appear to be more stable 
and accumulate in the spleen and liver. Thus, the 
micelles and liposomes prepared from chimeric proteins 
according to the present invention can be tailored to 

25 remain in the bloodstream for a desired period and to 
be delivered to specific organs. For example, small 
micelles according to the invention can be formed with 
an outer surface exhibiting a predominantly negative 
charge from the PI moiety, and such micelles would be 

0 ideal for delivering a particular protein moiety to the 
spleen or liver. 

The chimeric protein micelles can also be 
structured so that the protein moiety is located on the 
outside. This type of structure leads to a high 
5 concentration of protein activity in the 
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micxoenvironment surrounding the micelle. For example , 
depending on the nature of the protein component of the 
micellular chimeric protein, the micelle can function 
as a multivalent ligand or receptor, or as a unitary 
5 protein having multiplied activity. Depending on the 
characteristics of the ligand-receptor interactions, 
the close proximity of, e.g., multiple ligands on the 
surface of a micelle according to the invention, could 
confer a several-fold higher affinity (due to a lower 

10 on-off rate) for the ligand than that found for the 
monomer ic ligand protein. 

Many therapeutics are only effective when 
present in higher concentrations and must be injected 
in large doses to account for dilution and clearance 

15 effects. Mixed micelles can be engineered for drug 
targeting by including a second or third protein (to 
whiqh the PI moiety has been attached) which 
specifically localizes the micelles to the target 
tissue or > organ. For example, formation of a mixed 

20 micelle having chimeric ligand components and chimeric 
cytotoxic components, the former having an affinity for 
the surf ace receptor of a specific pathogen and the 
latter being toxic to the pathogen, produces a drug 
specifically targeted to the pathogen and also produces 

25 a therapeutic which localizes the effect of the 

cytotoxic component. The specific localization will 
assure a very high concentration of the drug at the 
site of interaction. 

Proteins to be included in micelles can be 

30 cell targeting proteins (cell surface ligands, tumor 
markers, antibodies to tumor surface antigens, etc.), 
hormones, ^toxins, cell activation ligands, soluble 
receptors (e.g., CD4) or any other peptides or protein 
drugs. 



rUl/ UO7U/V1037 



- 17 - 

Another approach to a mixed micelle is 
through the construction of a fusion protein to which 
the PI structure is attached. The fusion protein will 
exert plural functions, i.e., will exhibit plural 
5 domains corresponding to the protein fragments spliced 
together to form the fusion protein component. One 
part of such a fusion protein micelle component may act 
as a targeting protein (e.g., ligand of a specific 
receptor) while the other part of the protein may have 
10 a specific therapeutic function (e.g. , a toxin that 
lyses specific cells; a cellular activator, a hormone, 
etc . ) . 

III. Cell Targeting 

The chimeric proteins according to this 
15 invention can be expressed in specific host cells and 
used to target that cell to specific tissues or organs. 

As outlined above for the therapeutic 
applications of micelles, human cells with specific 
functions (killer cells, helper T-cells, etc.) can be 
20 engineered to be targeted to a specific site for 

action. Cytotoxic T- lymphocytes, natural killer cells, 
macrophages or other types of cells can be transfected 
with a Pi-linked soluble receptor or ligand specific to 
the target cell (e.g., tumor cell marker transfected 
25 into cytotoxic T-lymphocytes) . Such targeted cells may 
advantageously be cultured and specifically activated 
in vitro . 

Most cell surface proteins are transmembrane 
proteins having a cytoplasmic domain. The cytoplasmic 
30 domain often transduces a cellular signal upon 

interaction with its extracellular ligand. However, by 
replacing the cytoplasmic domain with the PI linkage it 
is possible to attach the extracellular portion of the 
protein without its cytoplasmic signalling seguence and 
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thx^s use that molecule strictly as a targeting 
molecule. Cell targeting in this fashion provides a 
number of cytolytic cells for site-specific action 
which normally do not show any specificity. 

5 IV. Screening DNA Libraries 

The sequence coding for the PI linkage can be 
incorporated in an expression vector and used to 
construct and screen a cDNA library for proteins which 
are normally secreted. 

10 The phosphatidylinositol linkage signalling 

sequences of the present invention may be incorporated 
in expression vectors and used to construct and screen 
a cDNA library. In this feature , a PI 
linkage-conferring sequence is included in the 

15 expression vector such that the cloned DNA coding for a 
normally ; secreted protein is anchored to the host cell 
surface on expression. The cells harboring the cloned 
DN£ can then be separated, e.g., by the panning 
technique described by Seed and Aruffo, "Molecular 

20 Cloning Of The CD2 Antigen, The T-Cell Erythrocyte 

Receptor , By A Rapid Immunoselection Procedure", Proc. 
Natl. Acad. Sci, USA r 84/ PP- 3365-69 (1987) 
(incorporated herein by reference) . 

The cDNA library is preferably constructed 

25 usi^g an animal cell expression vector which has been 
engineered so that each protein expressed has the PI 
linkage. The otherwise secreted proteins will thus be 
attached to the cell surface of the host cell, and 
cells expressing a particular protein of interest can 

30 be isolated using a monoclonal antibody to that 

protein. The isolation of cDNA for a secreted protein 
with monoclonal antibodies or a ligand for the protein 
by currently established techniques is very laborious 
and inefficient. The proposed technique therefore 
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represents a great improvement in efficiency and 
reliability. 

The following examples are provided in order 
to specifically illustrate the features of the present 
5 invention. Although the following description relates 
to a particularly preferred chimeric protein based on 
the extracellular domain of CD4, the examples are not 
intended to limit in any way the scope of the 
invention. 

10 EXAMPLES 

The following describes experiments to 
produce a Pi-linked form of CD4 protein by attaching a 
terminal segment derived from DNA coding for Pi-linked 
LFA-3 to DNA coding for a portion of the extracellular 

15 domain of CD4 , which is normally (i.e., in wild type 
human T-cells) a membrane protein. CD4 is a 
glycoprotein of 458 amino acids, including a 23-amino 
acid signal sequence (AA^ through AA 23 ) , a 373-amino 
acid extracellular domain (AA 24 through AA 3gg ), a 

20 transmembrane domain (AA 39? through AA 418 ), and a 

cytoplasmic tail (AA 41g through AA 458 ) . The amino acid 
sequence for CD4 is shown in Figure 3. See also, 
Littman et al., "Corrected CD 4 Sequence", Cell . 55 , 
p. 541 (1988). The putative signal sequence, 

25 extracellular domain, transmembrane domain and 

cytoplasmic domain are delineated by underscoring of 
the signal and transmembrane sequences. 

Construction o f a CD4/T.FA-3 Expression Vector 

In the following constructions, all 
30 restriction enzymes and other materials were used 
according to manufacturers' directions, unless 
otherwise specified. 
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DNA coding for PI-linked LFA-3 was obtained 
from plasmid P24, described in the aformentioned 
commonly assigned, copending United States patent 
application Ser. No. 237,309. An LB stab of E.coli 
5 cells carrying this plasmid was deposited in the in 
Vitro International, Inc. culture collection in 
Linthicum, Maryland, USA on July 22, 1988, under 
accession number IVI-10180. 

Plasmid P24 containing the full length cDNA 
10 coding for PI-linked LFA-3 (see Figure 2) was digested 
with restriction enzyme NotI (New England BioLabs, 
Beverly, * MA) and the 849-base pair (bp) fragment 
isolated by electrophoresis on 1% agar gel. The 849 bp 
fragment (P2 4 insert in Figure 1) was digested with 
15 restriction enzyme Bbvl (New England BioLabs) , and the 
205 bp fragment (N g45 -N 849 in Figure 1) was isolated by 
electrophoresis, then blunt-ended with DNA polymerase I 
(Klenow fragment, New England BioLabs) and ligated into 
the unique BstEII restriction site of expression vector 
20 BG391 (accession no. IVI-10151, In Vitro International, 
Inc.), which incorporates a gene for a recombinant 
soluble CD4 protein Corresponding to amino acids 1-325 
in Figure 3. See PCT publication no. WO 89/01940. 

The resulting recombinant DNA molecule, 
25 designated T4 /LFA-3 /AD, contained a DNA sequence coding 
for a polypeptide characterized by the N-terminal 267 
amino acids of CD4, followed directly by the 28 
C-terminal amino acids of PI-linked LFA-3. The coding 
sequence and deduced amino acid sequence of T4 /LFA-3 /AD 
30 is set forth in Figure 4. 

Plasmid T4 /LFA-3 /AD was linearized with Xmnl 
and used to cotransfect CHO (DHFR~) cells 
by the calcium phosphate method with the linearized 
StuI fragment of plasmid pAdD26, which carries the DHFR 
35 gene. See Kaufman and Sharp, Mol. Cell. Biol. . 2, 



- 21 - 



p. 1304 (1982) . Transfected cells were grown and 
selected in alpha (-) Modified Eagles Medium (MEM) 
(Gibco) . 

Surface expression of CD4 transfected cells 
5 was determined by Fluorescence Activated Cell Sorter 
(FACS) analysis: l x 10 6 transfected cells and control 
CHO cells were removed from tissue culture dishes by 
incubation with Hank's BSS buffer, 0.5 EDTA at 4°c for 
15 minutes. The detached cells were then pelleted, 
10 resuspended in 50 nl of PBN buffer (l x PBS, 0.5% BSA, 
0.1% sodium azide) and incubated with 100 fil of 
monoclonal antibody OKT4A (Becton Dickinson, 
Mountainview, CA) on ice for 45 minutes. The cells 
were pelleted by centrifugation and resuspended twice 
15 in PBN buffer. The cells were pelleted again and the 
cell pellets were resuspended in 100 nl of a 1:50 
dilution of FCI (Fluorescein Conjugated Affinity 
Purified F (ab») 2 Fragment Sheep Anti-Mouse IgG; 
Cappel Biomedical, Westchester, Pennsylvania, USA) in 
20 PBN buffer and incubated on ice for 30 minutes. The 
cells were layered onto a cushion of 300 Ml 100% fetal 
calf serum and then resuspended in 800 nl of 1 x PBS 
buffer and the intensity of fluorescence measured by 
FACS (Becton Dickinson, Mountainview, California, USA). 

By FACS analysis, 40 clones were obtained 
which were positive for surface expression of the 
chimeric CD4/LFA-3 polypeptide. Four positive clones 
were cultured in alpha (-) MEM and incubated with a 1:50 
dilution of PIPLC (a generous gift of M. Law, Columbia 
University, New York, New York, USA) in loo M l mem, at 
37-C for 30 minutes, i x 10 6 cells of each clone before 
and after PIPLC treatment were prepared for FACS 
analysis as above. The fluorescent intensities of the 
PIPLC treated clones were identical to the untreated 
clones, indicating that the CD4 protein was not 
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released and that expression of the T4 /LFA-3 /AD plasmid 
did not result in a chimeric polypeptide having a 
simple phosphatidylinositol anchor. 

A recombinant DNA molecule having a larger 
5 segment from the 3 1 end of LFA-3 cDNA was next 
constructed. Following the above procedure , a 
CD4 /LFA-3 hybrid DNA sequence was prepared which coded 
for the N-terminal 266 amino acids of the extracellular 
domain of CD4 and the C- terminal 51 amino acids of 
,10 Pi-linked LFA-3. The ligation introduced an isoleucine 
residue. This hybrid DNA molecule was designated 
T4/LFA-3/2. The coding sequence and deduced amino acid 
sequence of T4/LFA-3/2 is set forth in Figure 5. 

Plasmid T4 /LFA-3/ 2 was linearized with Xmn I 

15 as above and used to cotransfect 2 x 10 7 CHO (DHFR~) 
cells with Stul-linearized plasmid pAdD26 by the 
calcium phosphate method, as above. Transfected cells 
were grown and selected in alpha (-) MEM. 

- Sixteen resistant colonies were isolated, 

20 expanded and assayed for surface expression of a CD4 
protein by FACS as described above. Four clones that 
were strongly positive for CD4 surface expression, 
> T4/LFA-3/;2 #3, #8, #11 and #15, were assayed further 
for release of surface CD 4 after incubation with PIPLC, 

25 as described above. Two of the clones showed a 

decrease in surface CD4 after incubation with PIPLC, 
indicating that the extracellular portion of the CD4 
protein was attached to the cell extracellular portion 
of the CD 4 protein was attached to the cell surface via 

30 a phosphatidylinositol linkage (conferred by expression 
of the 3" phosphatidylinositol linkage signalling 
sequence excised from the P24 cDNA) ; two clones did not 
show decreased surface expression in CD4 after 
incijbation with PIPLC, indicating that the surface CD4 

35 of those particular clones was not Pi-linked. Figure 6 
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shows in the form of graphs the results of the PIPLC 
incubations on Clone T4 -LFA-3 -2 #11 (positive for 
Pi-linked CD4 expression) and Clone T4/LFA-3/2 #15 
(control, negative for Pi-linked expression) . 
5 Microorganisms and hybrid DNA molecules 

according to the invention are exemplified by a culture 
deposited in the culture collection of In Vitro 
International, Inc. 611 (P) Hammonds Ferry Road, 
Linthicum, MD, on April 5, 1989. The culture is 
10 identified as follows: 

Plasmid Culture Accession No 

T4/LFA-3/2 E.coli JA221 (pT4/LFA-3/2) IVI-10202 

While the products and methods of the present 
invention have been described herein with reference to 
15 particular DNA sequences and polypeptides, a wide range 
of additional embodiments are contemplated. For 
instance, although the foregoing description refers to 
a specific CD4/LFA-3 construct, many similar constructs 
are apparent from the disclosure, including, for 
20 example, longer or shorter portions of the DNA sequence 
encoding the extracellular domain of CD4, linked to the 
preferred LFA-3 PI linkage signalling sequence. Also, 
PI linkage signalling sequences derived from the 
Pi-linked form of LFA-3 which are longer or shorter 
25 than the specific sequence of the examples are 

contemplated, as are nonhomologous DNA sequences which, 
through the degeneracy of the genetic code, code on 
expression for the same operative C-terminal peptides 
as coded for by PI linkage signalling sequences derived 
30 directly from Pi-linked LFA-3. The present invention 
of course also contemplates chimeric polypeptides 
characterized by N-terminal extracellular regions 
derived from soluble proteins or the extracellular 
portions of proteins other than CD4. All such 
35 additional embodiments and obvious modifications are 
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within the intended scope of this invention as defined 
by the appended claims. 
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CLAIMS : 

1. A DNA sequence coding for a lymphocyte 
function-associated antigen 3 (LFA-3) 
phosphatidy linos itol linkage signalling sequence. 

2. A DNA sequence according to claim 1, 
selected from the group consisting of: 

5 ' -AGCAATCCATTATTTAATACAACATCATCAATCATTTTGACAACCTGT 
ATCCCAAGCAGCGGTCATTCAAGACACAGATATGCACTTATACCCATACCA 
TTAGCAGTAATTACy^CATGTATTGTGCTGTATATGAATGGTATGTATGCT 
TTT-3 1 , and DNA sequences which code on expression for 
a polypeptide coded for on expression by the foregoing 
DNA sequence. 

3. A hybrid DNA sequence comprising (a) a 
first DNA sequence coding for a secreted protein, a 
portion of a secreted protein or an extracellular 
region of a membrane protein, and, downstream of said 
first DNA sequence, (b) a second DNA sequence coding 
for a lymphocyte function-associated antigen 3 
phosphatidylinositol linkage signalling sequence 
operatively linked to said first DNA sequence such that 
expression of the hybrid DNA sequence results in a 
phosphatidylinositol-linked polypeptide. 

4. A hybrid DNA sequence according to 
claim 3, wherein the second DNA sequence is selected 
from the group consisting of: 

5 1 -AGCAATCCATTATTTAATACAAC^TCATCAATCATTTTGACAACCTGT 
ATCCCAAGC^GCGGTCATTCaLAGACACAGATATGCACTTATACCCATACCA 
TTAGCAGTAATTACAACATGTATTGTGCTGTATATGAATGGTATGTATGCT 
TTT-3 1 , and DNA sequences which code on expression for 
a polypeptide coded for on expression by the foregoing 
DNA sequence. 
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5. A hybrid DNA sequence according to claim 
3, wherein the first DNA sequence codes for a protein 
or portion of a protein selected from the group 
consisting of cell surface ligands, antibodies to cell 
surface antigens, hormones, cytotoxins, cell activation 
ligands, surface proteases, soluble receptors, and 
secreted or extrinsic proteins. 

6. A hybrid DNA sequence according to 
claim 5, wherein the first DNA sequence codes for an 
extracellular domain of CD4 . 

7. A recombinant DNA molecule comprising 
(a) a first DNA sequence coding for a secreted protein, 
a portion of a secreted protein or an extracellular 
region of a membrane protein, (b) a second DNA sequence 
downstream of said first DNA sequence coding for a 
lymphocyte function-associated antigen 3 
phosphatidylinositol linkage signalling sequence and 
operatively linked to said first DNA sequence such that 
expression of the first and second DNA sequences 
results in a phosphatidylinositol-linked polypeptide, 
and (c) an expression control sequence operatively 
linked to the first DNA sequence. 

8. A recombinant DNA molecule according to 
claim 7, wherein the second DNA sequence is selected 
from the group consisting of ; 

5 ■ -agcaatccattatttaatac^^catcatcaatca 
atcccaagc&gcggtcattcaa 

ttagcagtaattacaacatgtattgtgctgtatatgaatggtatgtatg^^ 
TTT-3 1 , and DNA sequences which code on expression for 
a polypeptide coded for on expression by the foregoing 
DNA sequence. 
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9. A recombinant DNA molecule according to 
claim 7 , wherein the first DNA sequence codes for a 
protein or portion of a protein selected from the group 
consisting of cell surface ligands, antibodies to cell 
surface antigens, hormones, cytotoxins, cell activation 
ligands, soluble receptors, surface proteases, and 
secreted or extrinsic proteins. 

10. A recombinant DNA molecule according to 
claim 7, wherein the first DNA sequence codes for an 
extracellular domain of CD4. 

11. A recombinant DNA molecule according to 
claim 7, wherein the expression control sequence is 
selected from the group consisting of early and late 
promoters of SV40 or adenovirus. 

12. A eukaryotic host cell culture 
transfected with a recombinant DNA molecule comprising 
(a) a first DNA sequence coding for a secreted protein, 
a portion of a secreted protein or an extracellular 
region of a membrane protein, (b) a second DNA sequence 
downstream of said first DNA sequence coding for a 
lymphocyte function-associated antigen 3 
phosphatidylinositol linkage signaling sequence and 
operatively linked to said first DNA sequence such that 
expression of the first and second DNA sequences 
results in a phosphat idy linos itol-linked polypeptide, 
and (c) an expression control sequence operatively 
linked to the first DNA sequence. 

13. A eukaryotic host cell culture according 
to claim 12, wherein the second DNA seguence is 
selected from the group consisting of: 
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5 1 -AGGAATCCATTAOTTAAT^ 
ATCCC^GCAGCGGTCATT 

TTAGCAGT^ATTACAACATGTATTGTGCTGTATATGAATGGTATGTATGCT 
TTT-3 1 f and DNA sequences which code on expression for 
a polypeptide coded for on expression by the foregoing 
DNA sequence. 

14. A eukaryotic host cell culture according 
to claim 12 , wherein the first DNA sequence codes for a 
protein or portion of a protein selected from the group 
consisting of cell surface ligands, antibodies to cell 
surface antigens , hormones, cytotoxins, cell activation 
ligands, soluble receptors, surface proteases, and 
sectreted or extrinsic proteins. 

15. A eukaryotic host cell culture according 
to claim 12, wherein the first DNA sequence codes for 
an extracellular domain of CD4. 

16. A eukaryotic host cell culture according 
to claim 12, wherein the expression control sequence is 
selected from the group consisting of early and late 
promoters of SV40 or adenovirus. 

17. A eukaryotic host cell culture according 
to claim 12, wherein the host cell is selected from the 
group consisting of CHO cells, Rl.l cells, COS cells, 
killer cells, helper T-cells, cytotoxic T- lymphocytes 
and macrophages . 

18. A polypeptide bearing a terminal 
phosphat idy linos itol structure encoded by a hybrid DNA 
sequence according to any one of claims 3-6. 
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19. A micelle consisting essentially of a 
multiplicity of polypeptides according to claim 18. 

20. A micelle according to claim 19, wherein 
the phosphatidylinositol structures are predominantly 
oriented on the outer surface of the micelle. 

21. A micelle according to claim 19 , wherein 
the phosphatidylinositol structures are predominantly 
oriented on the inner surface of the micelle. 

22. A method for producing a soluble form of 
a membrane protein comprising the steps of: 

(1) operatively linking to a DNA 
sequence coding for an extracellular portion of a 
membrane protein a DNA sequence coding for a lymphocyte 
function-associated antigen 3 phosphatidylinositol 
linkage signalling sequence, to form a hybrid DNA 
sequence ; 

(2) culturing a eukaryotic host cell 
transfected with an expression vector containing the 
hybrid DNA sequence, to produce extracellular 
polypeptides anchored to the cell membranes of the host 
cells by phosphatidylinositol linkages; 

(3) treating the cell culture with a 
reagent capable of disrupting phosphatidylinositol 
linkages ; and 

(4) collecting the soluble form of the 
polypeptide from the extracellular medium. 

23. A method according to claim 22, wherein 
the phosphatidylinositol linkage signalling sequence is 
selected from the group consisting of: 

5 1 -AGCAATCCATTATTTAATACAACATCATCAATCATTTTGACAACCTGT 
ATCCCAAGCAGCGGTCATTCAAGACACAGATATGCACTTATACCCATACCA 
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TTAGCAGTAATTACAACATGTATTGTGCTGTATATGAATGGTATGTATGCT 
TTT-3 ' , and DNA sequences which code on expression for 
a polypeptide coded for on expression by any of the 
foregoing DNA sequences. 

24. A method according to claim 22, wherein 
the reagent is selected from nitrous acid, 
phosphatidylinositol-specif ic phospholipase C or 
phosphatidylinositol-specif ic pospholipase D. 

25. A method according to claim 24, wherein 
the reagent is phosphatidylinositol-specif ic 
phospholipase C (PIPLC) . 

26. A method for producing target-directed 
host cells comprising the steps of: 

(1) isolating a DNA sequence coding for 
a ligand of a surface receptor on a target cell; 

(2) preparing a hybrid DNA sequence 
comprising the isolated DNA sequence and a DNA sequence 
coding for a lymphocyte function-associated antigen 3 
phosphat idy linos itol linkage signalling sequence 
operatively linked downstream of the isolated DNA 
sequence ; and 

(3) transfecting a host cell with an 
expression vector containing the hybrid DNA sequence, 
to obtain host cells bearing surface ligands to the 
target cell receptors. 

27. A method for screening a DNA expression 
library for the DNA of a secreted protein, comprising 
the steps of: 

(1) preparing a series of expression 
vectors, each vector including an expression control 
sequence, a DNA sequence coding for a lymphocyte 
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function-associated antigen 3 phosphat idy linos itol 
linkage signalling sequence, and a DNA isolate from a 
DNA library, said expression control sequence, DNA 
sequence and DNA isolate being oriented such that said 
DNA isolate is operatively linked to the expression 
control sequence and said DNA sequence is oriented so 
as to signal on expression phosphatidylinositol linking 
of any secreted polypeptide encoded by the DNA isolate; 

(2) transfecting eukaryotic host cells 
with the expression vectors prepared according to step 
(1) and culturing the transfected cells; and 

(3) exposing the transfected host cells 
to a substrate having one or more bound receptors or 
ligands for or antibodies against the polypeptide coded 
for by one or more of said DNA isolates. 

28. A method for producing micelles 
comprising a multiplicity of polypeptides bearing 
terminal phosphatidylinositol structures, comprising 
the steps of: 

(1) isolating a DNA sequence coding for 
a secreted protein, a portion of a secreted protein or 
an extracellular region of a membrane protein; 

(2) preparing a hybrid DNA sequence 
comprising the isolated DNA sequence and a DNA sequence 
coding for a lymphocyte function-associated antigen 3 
phosphatidylinositol linkage signalling sequence 
operatively linked downstream of the isolated DNA 
sequence ; 

(3) transfecting and culturing a host 
cell with an expression vector containing the hybrid 
DNA sequence, to obtain host cells bearing a 
phosphatidylinositol-linked form of the polypeptide 
encoded by the isolated DNA sequence on the host cell 
surface; 
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(4) lysing the transfectant cell 
culture with detergent, to obtain a dispersion 
containing monomeric polypeptides with intact 
phosphatidylinositol structures; and 

(5) removing the detergent by dialysis, 
optionally adding phospholipid, until micellular 
aggregates of the monomeric polypeptides with intact 
phosphatidylinositol structures are formed. 



1/iO 

COMPARISON OF HT16 AND P24 cDNAs 



HT16 
P24 

HT16 
P24 

HT16 
P24 

HT16 
P24 

HT16 
P24 

HT16 
P24 



MVAGS DAGRALGVL 
CGACGAGCCATGGTTGCTGGGAGCGACGCGGGGCGGGCCCTGGGGGTCCT 

IMIIMIIIIIIIIIIIllllMlllllllllillllllilllllllll 

CGACGAGCCATGGTTGCTGGGAGCGACGCGGGGCGGGCCCTGGOTGTCCT 
MVAGS DAGRALGVL 

SVVCLLHCFGFISCFSQ 
CAGCGTGGTCTGCCTGCTGCACTGCTTTGGTTTCATCAGCTGTTTTTCCC 100 

IIIIIIMIMMIIIIIIIIIIIIIIIIIIIIIIMllllliliMiii 

CAGCGTGGTCTGCCTGCTGCACTGCTTTGGTTTCATCAGCTGTTTTTCCC 100 
SVVCL LHCFGFISCFSQ 

QIYGVVYGNVTFHVPS 
AACAAATATATGGTGTTGTGTATGGGAATGTAACTTTCCATGTACCAAGC 

INIIIIIIIIMIIMIIIIIIIIIIIIlllMiliiiiiiiiiMiii 

AACAAATATATGGTGTTGTGTATGGGAATGTAACTTOCCATGTA^ 
QIYGVVYGNVTFHV PS 

NVPLKEVLWKKQKDKVA 
*^^T?T?? ^^^^^^^^^f^GGTCCTATGGAAAAAACAAAAGGATAAAGTTGC 200 

IMIIfllllMMIIIMIIIIMIIMlllilliiiiiiiiiiiiiii 

AATGTGCCTTTAAAAGAGGTCCTATGGAAAAAACAAAAGGATAAAGTTGC 200 
NVPLKEVLWKKQKDKVA 

ELENSEFRAFSSFK NRV 
AGAACTGGAAAATTCTGAATTCAGAGCTTTCTCATCTTTTAAAAATAGGG 

IMIIIIIIIIII M I II I M 1 1 1 M II Ml 1 1! II! MM ! II I ! I II I 

AGAACTGGAAAATTCTGAATTCAGAGCTTTC^ 
ELENSE FRAFSSFKNRV 

YLDTVSGSLTIYNLTS 

Ml III 1 1 II 1 1 Mil I II M M M II II 1 1 MM I MM 1 1 1 1 1 

TTTATTTAGACACTGTGTCAGGTAGCCTCACTATC 30 0 
YLDTVSGSLTIYNLTS 
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HT16 
P24 



2/iO 

SDED EYEMESPNITDTM 
TCAGATGAAGATGAGTATGAAATGGAATCGCCAAATATTACTGATACCAT 

MIIMIIIIIIIIIINIIMIIIIIIMIIIIIMIIIIIIIIIIIII 

TCAGATGAAGATGAGTATGAAATGGAATCGCCAAATATTACTGATACCAT 
SDEDEYEMESPNITDTM 



KFFLYVLESLPSPTLTC 
HT16 GAAGTTCTTTCTTTATGTGCTTGAGTCTCTTCCATCTCCCACACTAACTT 400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
P24 GAAGTTCTTTCTTTATGTGCTTGAGTCTCTTCCATCTCCCACACTAACTT 400 

KFFLYVL E SLP SPTLT C 



HT16 
P24 



ALTN GSIEVQCMIPEH 
GTGCATTGACTAATGGAAGCATTGAAGTCCAATGCATGATACCAGAGCAT 

1IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GTGCATTGACTAATGGAAGCATTGAAGTCCAATGCATGATACCAGAGCAT 
ALT NGS IEVQCMIPEH 



HT16 
P24 



YNSHRGLIMYSWDCPME 
TACAACAGCCATCGAGGACTTATAATGTACTCATGGGATTGTCCTATGGA 500 

llllllllllllllllllllllllllllllllllllllllllllllllll 
TACAACAGCCATCGAGGACTTATAATGTACTCATGGGATTGTCCTATGGA 500 

YNSHRGLIMYSWDCPME 



HT16 
P24 



QCKRNSTS IY FKMENDL 
GCAATGTAAACGTAACTCAACCAGTATATATTTTAAGATGGAAAATGATC 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII1IIII 

GCAATGTAAACGTAACTCAACCAGTATATATTTTAAGATGGAAAATGATG 
QCKRNSTS IYFKMENDL 



HT16 
P24 

HT16 
P24 



PQKIQCTLSNPLFNTT 
TTCCACAAAAAATA(ZAGTGTACTCTTAGCAATCCATTATTTAATACAACA 600 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

TTCC!ACAAAAAATACAGTGTACTCTTAGCAATCCATTATTTAATACAACA 600 
PQKIQCTLSN PLFNTT 

SSIILTTCIPSSGHSRH 
TCATCZ^TCATTTTGACAACCTGTATCCCAAGCAGCGGTCATTCAAGACA 

llilllllllllllllllllllllllllllllllllllllllllllllll 
TCATCAATCATTTTGACAACCTGTATCCCAAGCAGCGGTCATTCAAGACA 

SSIILTTCIPSSGHSRH 



RYALIPIPLAVITTCIV 
HT16 CAGATATGCACTTATACCCATACCATTAGCAGTAATTACAACATGTATTG 700 

I II 1 1 1 1 M M i M II I II 1 1 1 1 1 11 1 1 1 1 1 M I M I M M 1 1 1 M 1 1 1 1 

P 2 4 CAGATATGCACTTATACCCATACCATTAGCAGTAATTACAACATGTATTG 700 

RYALIPIPLAVITTCIV 
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HT16 
P24 

HT16 

P24 

HT16 

P24 

HT16 

P24 

HT16 

HT16 
HT16 
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LYMNGILKCDRKPD RT 
TGCTGTATATGAATGGTATTCTGAAATGTGACAGAAAACCAGACAGAACC 

IIIIIMIIMIIIIIIII | | | | IN || Ml 

TGCTGTATATGAATGGTATGTATGCTTTTTAAAACAAAATAGTTTGAAAA 
LYMNGMYAF* 

N S N * 

AACTCCAATTGATTGGTAACAGAAGATGAAGACAACAGCATAACTAAATT 800 

I II I I I II I I II || 

CTTGCATTGTTTTCCAAAGGTCAGAAAATAGTTTAAGGATGAAAATAAAG 800 

ATTTTAAAAACTAAAAAGCCATCTGATTTCTCATTTGAGTATTACAATTT 

MM II I I | I I I || 

TTTGAAATTTTAGAC^TTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAGCG 

TTGAACAACTGTTGGAAATGTAACTTGAAGCAGCTGCTTTAAGAAGAAAT 900 
GCCGC gOO 

ACCCACTAACAAAGAACAAGCATTAGTTTTGGCTGTCATCAACTTATTAT 

ATGACTAGGTGCTTGCTTTTTTTGTCAGTAAATTGTTTTTACTGATGATG 1000 

TAGATACTTTTGTAAATAAATGTAAATATGTACACAAGTG 1Q4Q 



FIG. 1 1 cont.) 
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T4LFA-3AD amino acid and DNA sequence 



1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGC 
MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLe\iAl 

GCTCCTCCCAGCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAAAAAG 100 
aLeuLeuProAlaAlaThrGlnGlyLysLysValValLeuGlyLysLysG 

101 GGGATACAGTGGAACTGACCTGTACAGCTTCCCAGAAGAAGAGCATACAA 
lyAspThrValGluLeuThrCysThrAlaSerGlnLysLysSerlleGln 

• • ■ 

TTCCACTGGAAAAACTCCAACCAGATAAAGATTCTGGGAAATCAGGGCTC 200 
PheHisTiTpLysAsnSerAsnGlnlleLysIleLeuGlyAsnGlnGlySe 

• • 

201 CTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCTGACTCAAGAA 
rPheLeuThrLysGlyProSerLysLenAsnAspArgAlaAspSerArgA 

GAAGCTTGTGGGACCAAGGAAACTTTCCCCTGATCATCAAGAATCTTAAG 300 
r gS erLeuTrpAs pGlnGlyAsnPheProLeu I le I leLy s As nLeuLy s 

301 ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGA 
IleGlnAspSerAspThrTyrlleCysGluValGluAspGlnLysGluGl 

« • • • • 

GGTGCAATTGCTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGC 400 
uValGlnLeuLeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuL 

• • • • * 

401 TTCAGGGGCAGAGCCTGACCCTGACCTTGGAGAGCCCCCCTGGTAGTAGC 
euGlnGlyGlnSerLeuThrLeuThrLeuGluSerProProGlySerSer 

■ « • • • 

CCCTCAGTGCAATGTAGGAGTCCAAGGGGTAAAAACATACAGGGGGGGAA 500 
ProSerValGlnCysArgSerProArgGlyLysAsnlleGlnGlyGlyLy 

• • • • « 

501 GACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGCACCTGGACAT 
sThrLeuSerValSerGlnLeuGluLeuGlnAspSerGlyThrTrpThrC 

• • • • • 

GCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTG 600 
ysThrValLeuGlnAsnGlnLysLysValGluPheLysIleAspIleVal 

a • « • • 

601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGA 
ValLeuAlaPheGlnLysAlaSerSerlleValTyrLysLysGluGlyGl 

FIG. 4 
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• • • • • 

ACAGGTGGAGTTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGG 700 
uGlnValGluPheSerPheProLeuAlaPheThrValGluLysLeuThrG 

• • • • « 

GCAGTGGCGAGCTGTGGTGGCAGGCGGAGAGGGCTTCCTCCTCCAAGTCT 
lySerGlyGlyLeuTrpTrpGlnAlaGluArgAlaSerSerSerLysSer 



TGGATCACCTTTGACCTGAAGAACAAGGAAGTGTCTGTAAAACGGGTTAC 800 
TrpIleThrPheAspLeuLysAsnLysGluValSerValLysArqValTh 

• • • • • 

AAGACACAGATAT GCAC TTATACC CATAC CATTAGCAGTAATTACAACAT 
r Ac qHi sArgTvrAl aLeu! lePr o I leProLeuAl aVal I leThrThr C 

• • • • . 

GTATTGTGCTGTATATGAATGGTATGTATGCTTTTTAAAACAAAATAGTT 900 
vs IleValLeuTvrMetAsnGlvMetTvr AlaPhe 

• • • 

TGAAAACTTGCATTGTTTTCCAAAGGTCAGAAAATAGTTTAAGGATGAAA 

• • • . . 

ATAAAGTTTGAAATTTTAGACATTTGAAAAAAAA^U^AAiy^AAAAAAAAAA. 1000 

AAAGCGGCC 1009 



FIG. 4 Icont.) 
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